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Objective: A genome-wide association study (GWAS) was recently completed in 1120 Pima Indians to 
identify loci that influence BMI. Among the top 100 signals were three variants that mapped within the 
lysophosphatidylglycerol acyltransferase 1 {LPGAT1) gene. LPGAT1 belongs to a large family of 
acyltransferases, which are involved in a variety of biological processes including pathways that regulate 
energy homeostasis and body weight. Therefore LPGAT1 was analyzed as a candidate gene for obesity in 
Pima Indians. 

Design and Methods: Variants (n = 26) located within and adjacent to LPGAT1 including a novel 27bp 
deletion in the 5'-untranslated region identified by sequencing were genotyped in a population-based 
sample of 3,391 full-heritage Pima Indians living in the Gila River Indian Community. Replication of 
selected variants was assessed in a second sample of 3,327 mixed-heritage Native Americans from the 
same community. 

Results: Variants with nominal associations with BMI in each of the two independent samples (tagged by 
rs1 12662024 and rs1 2058008) had associations of P = 1-4 x 10~~ 5 in the combined sample (n = 6718). 
A haplotype that includes the novel 27bp deletion, which does not occur in Caucasians, showed the 
strongest association with BMI in the full-heritage Pima Indians. In vitro functional studies provided 
suggestive evidence that this 27bp deletion may affect transcriptional or posttranscriptional regulation. 
Analysis of LPGAT1 cDNA from human preadipocytes identified an additional exon whose sequence 
could potentially serve as a mitochondrial targeting peptide. 
Conclusions: LPGAT1 is a novel gene that influences BMI in Native Americans. 
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Introduction 

The Pima Indians of Arizona have a high rate of obesity (1) and we 
have recently completed a genome-wide association study (GWAS) 
using the Affymetrix Genome-wide Human SNP Array 6.0 in 1,120 
Pima Indian subjects to identify genes that influence BMI (2). 
Among the top 100 variants associated with BMI, three variants 
were located within a region encompassing the lysophosphatidylgly- 
cerol acyltransferase 1 (LPGAT1) gene (P fs 10~ 4 ref. 2). LPGAT1 
is part of a large group of acyltransferases and belongs to the lyos- 
phosphatidic acid acyltransferase family, which also includes glyc- 
erol-3-phosphate acyltransferases 1-3, lysophosphatidylcholine acyl- 
transferases 1-2 (LPCAT1-2), and various lyosphosphatidic acid 
acyltransferases (3). Acyltransferases catalyze the transfer of an acyl 



group, and are involved in a broad range of metabolic pathways 
including fat storage, food intake, insulin sensitivity, energy homeo- 
stasis (4), and the biosynthesis of lipids (5). Knockout studies in 
mice suggest that several acyltransferases (e.g., GPATs, DGAT1/2, 
CPT1, MGAT2, and AGPAT6) may play a crucial role in body 
weight regulation (6-14). Yang et al. (15) showed that LPGAT1 is 
localized to the endoplasmic reticulum and may be involved in the 
reacylating (remodeling) of lysophosphatidylglycerol (LPG) back to 
phosphatidylglycerol. Hiramine et al. (16) demonstrated that 
LPGAT1 can also display monoacylglycerol acyltransferase activity. 
MGATs are enzymes involved in the synthesis of diacylglycerol — a 
precursor of triacylglycerol. They also observed that targeted knock- 
down of LPGAT1 in the liver of dh/dh mice resulted in a slight 
decrease in body weight. Based on the associations from our GWAS 



Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Disease, National Institutes of Health, Phoenix, 
Arizona, USA. Correspondence: Leslie J. Baier (lbaier@phx.niddk.nih.gov) 

Disclosure: The authors declared no conflict of interest.See the online ICMJE Conflict of Interest Forms for this article. 
Additional Supporting Information may be found in the online version of this article. 

Received: 1 5 September 201 1 Accepted: 1 7 May 201 2 First published online by Nature Publishing Group on behalf of The Obesity Society 26 July 201 2. 
doi:1 0.1 038/oby.201 2.1 61 



www.obesityjournal.org 



Obesity I VOLUME 21 I NUMBER 1 I JANUARY 2013 



193 



Obesity 



Role of LPGAT1 in Influencing BMI Traurig et al. 



with BMI and the potential role of LPGAT1 in lipid metabolism and 
body weight regulation, LPGAT1 was analyzed as a candidate gene 
for obesity in the Pima Indians of Arizona. 

Methods and Procedures 
Subjects and phenotypes 

Characteristics for the individuals and specific exams included in 
this study are provided in Supplementary Table SI online. The sub- 
jects are part of a longitudinal study (1965-2004) of the etiology of 
type 2 diabetes in the Gila River Indian Community in Central Ari- 
zona (17). Subjects were invited to participate in a health exam ev- 
ery 2 years, which included a 75-g oral glucose tolerance test to 
determine diabetes status along with measures of height and weight 
to determine BMI. Only individuals who had at least one measure 
of BMI at >15 years of age were included in this study and exams 
at which the subject was pregnant were excluded. The full-heritage 
Pima Indian population-based sample consisted of all individuals 
whose heritage was reported as full Pima and/or Tohono O'odham 
(a closely related tribe) (n = 3,391). The mixed-heritage population- 
based sample consisted of all of the remaining individuals who had 
been studied in the Gila River Indian Community (n = 3327). Their 
reported heritage on average was one-half Pima and one-third Ameri- 
can Indian, which may include other tribes. Since many of these longi- 
tudinally studied subjects had multiple exams in which BMI was 
measured (Supplementary Table SI online), we analyzed both the 
maximum BMI recorded from any single exam and repeated measures 
of BMI, which used BMI data from all exams. A subset of these sub- 
jects (« = 555) was additionally studied as inpatients in our Clinical 
Research Center when they were nondiabetic. These individuals under- 
went tests to measure quantitative traits related to obesity such as per- 
cent body fat. Some of these subjects took part in multiple exams. In 
this study, we analyzed both percent body fat measured at a single 
exam (the exam when the subject was the youngest was selected) as 
well as repeated measures of percent body fat which included meas- 
ures from all visits (Supplementary Table SI online). Body composi- 
tion was estimated by underwater weighing until January 1996, and by 
dual energy X-ray absorptiometry (DPX-1; Lunar Radiation, Madison, 
WI) thereafter. A conversion equation derived from comparative anal- 
yses was used to make estimates of body composition equivalent 
between the two methods (18). All of the subjects provided written 
informed consent before participation in the study. This study was 
approved by the Institutional Review Board of the National Institute 
of Diabetes and Digestive and Kidney Diseases (NIDDK). 

Sequencing and genotyping 

To identify potential novel variants in LPGAT1 , all of the coding 
exons, exon-intron boundaries, 5'- and 3'-untranslated regions 
(UTRs) and the promoter regions of LPGAT1 /ucOOlhiu.l (isoform 
A) and LPGATl/ucOOlhiv.l (isoform B) were sequenced in 24 unre- 
lated (not related by the first degree) full-heritage Pima Indians. To 
determine whether the novel 5'-UTR 27bp deletion (LPAGT1-4) is 
unique to Native Americans, a DNA panel (Human Random Control 
Panel-5; Sigma- Aldrich, St. Louis, MO) consisting of 96 randomly 
selected unrelated Caucasian subjects was also sequenced. Sequenc- 
ing was done using a Big Dye terminator kit (Applied Biosystems, 
Foster City, CA) on an automated DNA capillary sequencer (model 
3730; Applied Biosystems). Genotyping was done by using either 
the SNPlex genotyping System 48-plex (Applied Biosystems) on an 



automated DNA capillary sequencer (model 3730; Applied Biosys- 
tems), the Illumina BeadXpress System (Illumina, San Diego, CA), 
or by allelic discrimination polymerase chain reaction (PCR) 
(Assays-on-Demand SNP Genotyping products; Applied Biosys- 
tems). Genotyping data underwent quality control assessment, which 
required a successful call rate of >85% of all samples, a deviation 
from Hardy-Weinberg equilibrium of P > 0.001, and a discrepancy 
rate of <2.5% for blind duplicate samples (330 and 100 blind dupli- 
cated samples in the full-heritage Pima Indian and mixed-heritage 
population-based samples, respectively). 

Statistical analyses 

Statistical analyses were performed using the statistical analysis sys- 
tem of the SAS Institute (Cary, NC). For continuous variables, linear 
regression models were used to analyze the association between 
BMI and genotype with adjustments for covariates including age, 
sex, and birth year. The generalized estimating equation procedure 
was used to account for family membership since some of the sub- 
jects were siblings. The logarithmic transformation of BMI was used 
in these analyses to reduce skewness. In the mixed-heritage Native 
American sample, the individual estimate of European admixture 
was also used as a covariate. These estimates were derived from 32 
markers with large differences in allele frequency between popula- 
tions (19) using the analytical method described in Hanis et a/.(20). 
A combined test of association for the full-heritage Pima Indian and 
mixed-heritage Native American samples was conducted by the 
inverse variance method (21). For the repeated BMI and percent 
body fat measures, the SAS MIXED procedure which fits a linear 
mixed effects model using a maximum likelihood method was used 
for the association analyses (described in ref. 2). Six haplotypes 
could be defined by the five variants shown in Table 1. Haplotype 
analyses were conducted using a modification of the zero-recombi- 
nant haplotyping method (22), as previously described (23). In brief, 
the MLINK program was used to assign haplotype probabilities for 
each individual based on their genotypes, the genotypes of their rel- 
atives, and the population haplotype frequencies. These probabilities 
were then used in regression models, in a fashion analogous to the 
analysis of single variants, to assess the association with each indi- 
vidual haplotype in comparison with all others. Linkage disequili- 
brium (LD: D' and r 2 ) was analyzed using the Haploview program 
(Haploview http://www.broad.mit.edu/mpg/haploview/). Mixed 
effects models were used to assess an additive model of association 
between the LPGAT1 haplotypes and LPGAT1 gene expression; age, 
sex, and Pima heritage were included as fixed effects with related- 
ness treated as a random effect. 

Plasmid constructs 

Genomic fragments with or without the rs 112662024 G insertion 
located in the 3'-UTR of LPGAT1 were amplified by PCR using 
DNA from a known heterozygous individual as a template. The 
fragments (402 bps, chrl:209,989,461-209,989,862, NCBI build 
36.1, UCSC Genome Browser) were cloned into the pGL3-promoter 
vector (Promega, Madison, WI) at the Xbal and Fsel sites located at 
the 3' end of the luciferase gene, which will allow the fragments to 
be transcribed as the 3'-UTR of the gene. To examine whether the 
27bp deletion (LPGAT1-4) located in exon 1 (encodes the 5'-UTR) 
of LPGATl/ucOOlhiv.l (isoform B) may affect transcription of iso- 
form A or isoform B, genomic fragments with and without the dele- 
tion were amplified by PCR using genomic DNA from a known 
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TABLE 1 Association results for the five tag variants with BMI in the full-heritage Pima Indian, mixed-heritage Native 
American, and combined sets 



Mean BMI (kg/m 2 ) 



Variant 


Sample 8 


Analysis 


M/m 


mAF 


M/M 


M/m 


m/m 


Beta 


pb 


rs77541843 


Full-heritage 


Max BMI 
Repeated BMI 


G/C 


0.33 


37.3 


37.3 


37.9 


-0.007 
-0.006 


0.25 
0.28 




Mixed-heritage 


Max BMI 
Repeated BMI 




0.22 


34.4 


35.5 


35.9 


-0.014 
-0.019 


0.06 

9 x 1 0 :i 




Combined 


Max BMI 
Repeated BMI 




0.28 


35.6 


36.5 


37.3 


-0.009 
-0.011 


0.05 
0.01 


rs1 12662024 


Full-heritage 


Max BMI 
Repeated BMI 


-G/+G 


0.22 


38.0 


36.7 


36.1 


0.027 
0.023 


8x 10 fi 
8 x 1 0 s 




Mixed-heritage 


Max BMI 
Repeated BMI 




0.23 


35.1 


34.5 


34.5 


0.015 
0.014 


0.04 
0.04 




Combined 


Max BMI 
Repeated BMI 




0.23 


36.6 


35.6 


35.3 


0.021 
0.020 


4 x 10 b 
1 x 10 6 


rs4951552 


Full-heritage 


Max BMI 
Repeated BMI 


C/A 


0.23 


37.9 


36.5 


35.7 


0.027 
0.023 


8x 10 5 
1 x 1 0- J 




Mixed-heritage 


Max BMI 
Repeated BMI 




0.33 


35.5 


34.6 


33.8 


0.012 
0.011 


0.06 
0.08 




Combined 


Max BMI 
Repeated BMI 




0.28 


36.8 


35.4 


34.4 


0.018 
0.017 


1 x 10'' 
8x 10 5 


LPGAT1-4 


Full-heritage 


Max BMI 
Repeated BMI 


wt/-27 bp 


0.20 


37.8 


36.8 


36.0 


0.023 
0.020 


7 x 1 0-" 
1 x 1 0 3 




Mixed-heritage 


Max BMI 
Repeated BMI 




0.17 


34.7 


35.3 


34.9 


-0.003 
-0.004 


0.65 
0.63 




Combined 


Max BMI 
Repeated BMI 




0.19 


36.2 


36.1 


35.5 


0.011 
0.010 


0.04 
0.03 


rs1 2058008 


Full-heritage 


Max BMI 
Repeated BMI 


G/A 


0.24 


38.0 


36.6 


36.4 


0.024 
0.021 


2x10'' 
3 x 1 0-* 




Mixed-heritage 


Max BMI 
Repeated BMI 




0.33 


35.5 


34.6 


33.6 


0.016 
0.014 


0.01 
0.02 




Combined 


Max BMI 
Repeated BMI 




0.29 


36.9 


35.5 


34.6 


0.020 
0.018 


3 x 1 0- 5 
3 x 1 0" 5 



Beta estimates and P values are given for both the maximum BMI recorded from a single exam (max BMI) and all BMIs recorded from the longitudinal exams (repeated 
BMI). M, major allele; m, minor allele; mAF, minor allele frequency. 

•'Combined, full-heritage Pima Indians + mixed-heritage Native Americans. "P values for maximum BMI were adjusted for age, sex, birth year, family membership, and 
Native American heritage. For the repeated BMI measures, P values were adjusted for age, sex, diabetes status, diabetes duration, birth year, and Native American 
heritage. Position and sequence information for the novel variant LPGAT1-4 (27 bp deletion) are shown in Supplementary Figure 1a,b online. 



heterozygous individual. The inserts (isoform A: 781 and 754 bps, 
chrl:210,069,986-210,070,766; isoform B: 600 and 573 bps, 
chrl:210,070,530-210,071,129, NCBI build 36.1, UCSC Genome 
Browser) were cloned into the promoterless pGL4.17 vector (Prom- 
ega) at the multiple cloning region. To examine whether the 27bp 
deletion may affect gene expression posttranscriptionally, the 5'- 
UTRs of LPGAT1 isoform B (RNA accession, NM 014873.1; 



UCSC ID, ucOOlhiv.l) and LPGATl.cApr07 (AceView gene mod- 
els) with and without the deletion were cloned into the pGL3-pro- 
moter vector at the Hindlll and Ncol sites located at the 3' end of 
the SV40 promoter and 5' end of the luciferase gene, respectively. 
Inserts were amplified by PCR using human cDNA from an individ- 
ual heterozygous for this variant. All constructs were verified by 
sequencing. 
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TABLE 2 Haplotype analysis for the LPGAT1 variants with BMI 



Full-heritage Pima Indians Mixed-heritage Native Americans Combined 

Mean BMI (kg/m 2 ) Mean BMI (kg/m 2 ) 



Haplotype a Freq 0 copies 1 copy 2 copies P° Freq 0 copies 1 copy 2 copies P" P b 



1. 


G 


+G 


A 


wt 


A 


0.026 


36.5 


35.7 




0.26 


0.065 


33.9 


31.6 




3.8 x 10" 6 


6.5 x 10" 6 


2. 


G 


■ G 


A 


-27bp 


A 


0.185 


36.8 


35.7 


35.3 


6.5 x 10 A 


0.165 


33.6 


33.8 


33.3 


0.60 


0.04 


3. 


G 


-G 


A 


wt 


A 


0.008 


36.4 


37.1 


32.7 


0.75 


0.095 


33.7 


33.8 


31.9 


0.90 


0.95 


4. 


G 


-G 


c 


wl 


A 


0.018 


36.4 


36.1 


32.9 


0.49 


0.023 


33.6 


34.6 




0.39 


0.99 


5. 


G 


- G 


c 


wt 


G 


0.423 


36.2 


36.2 


37.3 


0.03 


0.438 


33.7 


33.6 


33.7 


0.98 


0.15 


6. 


C 


-G 


c 


wt 


G 


0.333 


36.2 


36.5 


36.8 


0.25 


0.211 


33.4 


33.9 


34.2 


0.11 


0.06 



Freq, frequencies for the haplotypes; wt, wild-type (no deletion). 

"The haplotypes are composed of the following variants in this order: rs77541 843 (G/C), rs1 1 2662024 (-G/+G), rs4951 552 (C/A), LPGAT1 -4 (wt/-27 bp), and rs1 2058008 
(G/A). b P values were adjusted for age, sex, birth year, family membership, and Pima heritage. 



Cell transfections and luciferase assays 

HepG2, Caco-2, 3T3-L1, and G8 myoblasts were all purchased from 
American Type Culture Collection (ATCC, Manassas, VA). The 
HepG2 and Caco-2 cells were maintained in Eagle's Minimum 
Essential Medium (ATCC) supplemented with 10 and 20% fetal bo- 
vine serum, respectively (ATCC). The 3T3-L1 cells were cultured in 
Dulbecco's Modified Eagle's Medium (ATCC) supplemented with 
10% calf bovine serum (ATCC), whereas the G8 mouse myoblasts 
were cultured in Dulbecco's Modified Eagle's Medium (ATCC) sup- 
plemented with 10% fetal bovine serum (ATCC) and 10% horse se- 
rum (ATCC). All transfections were done with either Lipofectamine 
2000 or Lipofectamine LTX and PLUS Reagents (Invitrogen, Carls- 
bad, CA) following the manufacturer's instructions using serum and 
antibiotic free medium. To control for transfection efficiency, co- 
transfections were done using the pGh4J4[hRluc/TK] plasmid 
(Promega) and normalized to the Renilla luciferase expression. 
Forty-eight hours after transfection the cells were washed with phos- 
phate buffer solution and cell lysates were prepared using the lysis 
buffer from the Dual-Luciferase Reporter Assay System (Promega). 
Luciferase activities were measured using the Dual-Luciferase Re- 
porter Assay System. Luciferase activity is expressed as the relative 
activity and is shown as the means ± s.d. for six replicates. 



Isolation of preadipocyte cDNA and screening for 
the 27bp 5'-UTR deletion 

Preadipocytes were previously obtained from subcutaneous abdomi- 
nal adipose tissue biopsies as described (24,25) and total RNA was 
extracted using an RNeasy Mini Kit (Qiagen, Valencia, CA). To 
remove any residual DNA, the purified RNA was treated with 
DNAse using RNAse-free DNAse Set (Qiagen). First-strand cDNA 
was synthesized from preadipocyte total RNA using a BD Advant- 
age RT-for-PCR Kit (BD Bioscience/Clontech, Mountain View, CA) 
following the manufacturer's instructions. Primers located in exons 
1 and 2 of isoform B (RNA accession, NM_014873.1; UCSC ID, 
LPGATl/ucOOlhiv.l) were designed to screen the preadipocyte 
cDNA for the presence of the 27bp deletion. Primer sequences were 
as follows: forward, 5'-GCCCTT GGGGACCGAGTCTC-3' and 
reverse, 5'-TAGCAACCAGGTTGTTGACGACCA-3'. Since exons 1 
and 2 are located in a GC rich region, a GC PCR kit (BD Bio- 



science/Clontech) was used for the PCR reactions. PCR amplified 
products were separated by gel electrophoresis using a 2% super 
fine resolution agarose gel (Amresco, Solon, OH), gel purified using 
a QIAquick gel extraction kit (Qiagen), and verified by sequencing. 

Results 

Identification of novel variants and association 
analysis 

Sequencing of both LPGAT1 isoforms A and B in 24 unrelated 
Pima Indians detected several variants previously identified in other 
ethnic groups (i.e., variants found in dbSNPs build 135, data not 
shown) and four novel variants including one in exon 1 (5'-UTR) of 
isoform A, two in the promoter region of isoform A, and a 27bp de- 
letion (LPGAT1-4) in exon 1 (5'-UTR) of isoform B (locations and 
sequence information for the four novel variants is provided in Sup- 
plementary Figure Sla,b online). Several variants (n = 26) including 
7 detected by sequencing (the 27bp deletion and 6 previously identi- 
fied variants), 9 from the initial GWAS, and 10 tag SNPs from the 
CHB HapMap (minor allele frequency >0.1 and a pairwise r 2 > 
0.8) spanning a 197-kb region encompassing the LPGAT1 gene were 
selected for genotyping in a population-based sample of 3,391 full- 
heritage Pima Indians. The LD patterns (£>' and r 2 ) for the 26 variants 
are shown in Supplementary Figure S2 online. Several variants located 
in a LD block that spans LPGAT1 and its upstream 5' region were 
associated with BMI in the full-heritage Pima sample (Supplementary 
Table S2 online). Polymorphisms within this block with allele frequen- 
cies >1% could be tagged by five variants (Supplementary Figure S2 
online). Four of these five tag variants were associated with BMI in 
the full-heritage Pima Indian sample (Table 1). Defining BMI in these 
longitudinally studied subjects as either maximum BMI (the maximum 
BMI on record) or repeated BMI measures (BMI from all longitudinal 
exams, ranging from 1-20 exams per individual) provided similar levels 
of associations (Table 1). Among the 555 nondiabetic subjects who had 
been characterized for traits related to obesity, the 4 variants showing 
the strongest evidence for association with BMI in the full-heritage 
Pima sample also had the strongest associations with percent body fat 
and repeated percent body fat measure (Supplementary Table S3 
online). Genotyping of the five tag variants in a second sample of 
mixed-heritage Native Americans (n = 3327) showed that four of the 
variants had a nominal association with BMI (Table 1). The strongest 
independent replications were with rsl 1266202 and rsl2058008, and 
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TABLE 3 Haplotype analysis for the LPGAT1 variants with BMI 













Muscle (n 


= 166) 


Adipose (n 


= 156) 


Haplotype 3 








P 


pb 




pb 


1. G 


+G 


A 


vVt 


A 


0.13 


0.63 


0.09 


0.77 


2. G 


+G 


A 


-27 bp 


A 


-0.54 2 


x 10 5 


-0.19 


0.17 


3. G 


-G 


A 


wt 


A 


0.20 


0.63 


0.22 


0.64 


4. G 


-G 


C 


wl 


A 


0.35 


0.44 


-0.48 


0.28 


5. G 


G 


C 


wt 


G 


0.11 


0.30 


0.16 


0.16 


6. C 


-G 


C 


wt 


G 


0.22 


0.06 


-0.07 


0.57 



wt, wild-type (no deletion). 

a The haplotypes are composed of the following variants in this order: rs77541 843 
(G/C), rs1 12662024 (-G/+G), rs4951552 (C/A), LPGAT1-4 (wt/-27bp), and 
rs1 2058008 (G/A). h P values were adjusted for age, sex, and Pima heritage. 



combining the full-heritage and mixed-heritage population sets provided 
the strongest evidence for association with BMI (Table 1). However, 
rs77541843 which had no association with BMI in the full-heritage 
Pima sample had the strongest association in the mixed-heritage sam- 
ple, and LPGAT1-4 (27bp deletion) was associated with BMI in the 
full-heritage sample, but not in the mixed-heritage sample (Table 1). 

As would be predicted from the single variant association data, the 
LD (r 2 ) between the five variants in Table 1 differed in the full-her- 
itage Pima Indians compared with the mixed-heritage Native Ameri- 
cans due to ethnic differences in allele frequencies (Supplementary 
Figure S3 online). In particular, the 27bp deletion (LPGAT1-4) 
located in the 5'-UTR has a 20% frequency in full-heritage Pima 
Indians, but is not annotated in any public database, and sequencing 
of this region in 96 unrelated Caucasians did not find the 27bp 
deletion in any of the individuals. These five variants produce six 
haplotypes (Table 2). Haplotype 2 had the strongest association with 
BMI in the full-heritage Pima Indian sample, whereas haplotype 1 
had the strongest association in the mixed-heritage sample. Haplo- 
types 1 and 2 differ only at the 27bp deletion (Table 2). 

To determine whether any of the haplotypes could have functional 
consequences, we looked for an association between each haplotype 
and in vivo LPGAT1 gene expression in muscle and adipose tissues. 
Using genome-wide expression data that had previously been obtained 



from full-heritage Pima Indians (26), we found that only haplotype 2 
was associated with LPGAT1 expression levels in muscle (Table 3). 
Individuals with one copy of haplotype 2 had lower LPGAT1 expres- 
sion than individuals with no copies of the haplotype, and individuals 
with two copies of haplotype 2 had the lowest levels of expression 
(Figure la). Although a similar trend of LPGAT1 gene expression 
based on copies of haplotype 2 was observed in adipose tissue, the 
association was not significant (Figure lb). 

Identification of an additional exon in LPGAT1 

To verify the presence of the 27bp deletion (LPGAT1-4) in the 5'- 
UTR of the LPGAT1 isoform B mRNA transcript, subcutaneous pre- 
adipocyte cDNA previously isolated from 10 Pima Indian subjects 
was PCR amplified using primers located in exons 1 and 2 of iso- 
form B and the PCR products were analyzed by gel electrophoresis. 
As expected, individuals homozygous for the 27bp deletion had a 
single amplified band running slightly lower than the band observed 
for individuals homozygous for the nondeleted mRNA transcript, 
whereas heterozygous individuals had both bands (Figure 2a). How- 
ever, all of the PCR products were 148 bases larger than anticipated. 
Sequencing and alignment of the PCR products to the human 
genome (Human BLAT search, UCSC Genome Browser, Human 
March 2006 assembly) revealed that this cDNA matches a partial 
LPGAT1 transcript (LPGATl.cApr07) shown in the AceView gene 
models track, UCSC Genome Browser which differs from the 
LPGAT1 isoform B in that it has an additional exon (148 bps) 
located between exons 1 and 2 (Figure 2b). Part of this alternative 
exon may be translated (Figure 2b, thicker region and Figure 2c, 
single underlined region) and the subcellular localization prediction 
programs TargetP 1.1 (27) and iPSORT (28) both predicted that the 
alternative N-terminal sequence could potentially serve as a mito- 
chondrial targeting peptide (data not shown). For comparison, the 
N-terminal sequences for isoforms A and B (Figure 2b, exon 2 for 
both isoforms and Figure 2c, double underlined region) were also 
queried and both programs predicted that this region may function 
as an endoplasmic reticulum signal peptide (data not shown). 

Functional studies for the rs1 12662024 and 
LPGAT1 -4 mutations 

Rs 112662024 and LPGAT1-4 (27bp deletion) were selected for fur- 
ther functional analysis based on their locations within the gene. The 
rs 112662024 (G insertion) variant is located in a highly conserved 
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FIGURE 1 LPGAT1 expression levels based on the copy number of haplotype 2 in (a) muscle and (b) adipose. Means 
are indicated by the horizontal lines. P values were adjusted for age, sex, and Pima heritage. LPGAT1 , lysophosphati- 
dylglycerol acyltransferase 1 . 



www.obesityjournal.org 



Obesity I VOLUME 21 I NUMBER 1 I JANUARY 2013 



197 



Obesity 



Role of LPGAT1 in Influencing BMI Traurig et al. 



^ w ****** m 

V W ^ #r 9 V * 




Gt: wt/ wt/ wt/ wt/ wt/ wt/ wt/ 
-27 bp wt wt -27 bp -27 bp -27 bp wt 



-27 bp/ wt/ wt/ 
-27 bp -27 bp wt 



aligned sequence 



H h- 



LPGAT1.cApr07 



Exon 3 



Exon 2 



Exon 1 



Isoform A 



Exon 2 



Exon 1 



Isoform B 



Exon 2 



Exon 1 



MTATPRRGTPAPALRSRPPSAPAEKDRTESSVRMAITLEEAPWLGWLLV 
KALMRFA FMVVNNLVAIPSYICYVIILQPLRVLDSKRFWYIEGIMYKWL 
LGMVASWGWYAGYTVMEWGEDIKAVSKDEAVMLVNHQATGDVCTLMMCL 
QDKGLVVAQMMWLMDHIFKYTNFGIVSLVHGDFFIRQGRSYRDQQLLLL 
KKHLENNYRSRDRKWIVLFPEGGFLRK 



FIGURE 2 Identification of an alternative exon 2 in LPGAT1 . (a) Gel image showing the results for the PCR reactions. 
PCR reactions were done using subcutaneous preadipocyte cDNA isolated from 10 Pima Indian subjects, (b) Sche- 
matic showing the alignment of the sequenced cDNA PCR products (aligned sequence) with the first three exons of 
the LPGAT1 .cApr07 partial transcript (AceView Gene Models track, UCSC Genome Browser). The alternative exon 
identified by sequencing, which is also present in the LPGAT1 .cApr07 transcript is highlighted by the gray shaded rec- 
tangle. For the LPGAT1 .cApr07, isoform A, and isoform B transcripts the thicker bars indicate regions that are trans- 
lated and the thinner bars indicate regions that are not translated. Horizontal arrows in exons 1 and 2 of isoform B 
show the locations of the primers used to screen the preadipocyte cDNA. (c) Amino acid sequences for the translated 
exons shown in (b). Single underline indicates the putative mitochondrial targeting peptide (exon 2 of LPGAT1 .cApr07). 
The signal peptide was predicted using TargetP 1.1(25) and iPSORT(26) subcellular localization prediction programs. 
Double underline indicates the endoplasmic reticulum signal peptide for both isoforms A(15) and B. Gt, genotype; L, 
DNA ladder; LPGAT1 , lysophosphatidylglycerol acyltransferase 1 ; wt, wild-type (no deletion). 



region in the 3'-UTR 159 bases from the stop codon and 79 bases 
from a predicted miRNA binding site (Supplementary Figure S4a,b 
online) whereas the 27bp deletion is located in the 5'-UTR (exon 1) of 
isoform B in a region with strong regulatory potential (Supplementary 
Figure S4b online). To examine whether rs 112662024 could possibly 
affect posttranscriptional regulation (e.g., mRNA stability, translation 
efficiency), genomic fragments with and without the G insertion (+G 
and — G, respectively; Supplementary Figure S5a online) were cloned 
into the pGL3-promoter vector at the 3' end of the luciferase gene. 
Since 14 additional variants were also indentified in the 3'-UTR, only 
the first 402 bps were cloned in order to avoid those other variants. 
The two constructs (rsl 12662024~ G and rsl 12662024+°) and a no 
insert control (empty pGL3-promoter vector, pGL3-PV) were then co- 



transfected with pGL4.74[M/i/f/TK] into HepG2, Caco-2, 3T3-L1, 
and G8 myoblast cell lines and luciferase activities were measured 48 
h posttransfection. There was no significant difference in luciferase 
activity between the rsl 12662024~ G and rsl 12662024 +G constructs in 
any of the four cell lines tested (Supplementary Figure S5b-e online). 
However, in all four cell lines there was a significant decrease in lucif- 
erase activity for both constructs containing the rsl 12662024 inserts 
as compared with the pGL3-promoter vector control (no insert) (Sup- 
plementary Figure S5b-e online) suggesting that there may be a poten- 
tial regulatory element located within this 402 bp region. 

To determine whether the 27bp deletion (LPGAT1-4) affected a puta- 
tive distal promoter element in isoform A or a promoter regulatory 
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FIGURE 3 Functional studies to examine whether the 27 bp deletion affects transcriptional regulation, (a) Diagram 
showing the LPGAT1 -4 wt and LPGAT1 -4~ 2 7 bp regions (dashed lines) that were cloned into the promoterless pGL4.17 
vector. Relative luciferase activities for the LPGAT1 -4 wt and LPGAT1 -4~ 27 bp isoform A and isoform B promoter con- 
structs in (b) HepG2, (c) CaCo-2, (d) 3T3-L1, and (e) G8 myoblast cell lines. Transfections were done with equal 
amounts of plasmids. The constructs were co-transfected with pGL4.74[r)fi/ucATK] and luciferase levels are expressed 
as the relative activities and are shown as the means ± s.d. for six replicates. LPGAT1, lysophosphatidylglycerol acyl- 
transferase 1 ; pGL3-BV, pGL3-basic vector no insert control; wt, wild-type (no deletion). 



element in the 5'-UTR of isoform B, two sets of inserts without (wild- 
type) and with (— 27bp) the deletion (Figure 3a) were cloned into the 
promoterless pGL4.17 vector. The constructs, isoform A wt , isoform 
A _27bp , isoform B wt , and isoform B~ 27bp , were co-transfected with 
pGL4.74[/!fl/wc/TK] into HepG2, Caco-2, 3T3-L1, and G8 myoblast 
cell lines and luciferase activities were measured 48 h posttransfection. 
Both sets of LPGATl-4 wt and LPGATl-4~ 27bp constructs had higher 
luciferase activities compared with the no insert control (pGL4.17) in 
all four cell lines (Figure 3b-e). However, there were only modest dif- 
ferences in luciferase activities between the isoform A wt and isoform 
A _27bp constructs in the Caco-2 and 3T3-L1 cells (Figure 3c,d), indicat- 
ing that the 27bp deleted region is unlikely to significantly influence 
LPGAT1 isoform A or isoform B promoter activity in these cell lines. 

Since the 27bp deletion is located in the 5'-UTRs of isoform B and 
LPGATl.cApr07 (Figure 4a), the 5'-UTRs of both mRNAs without 



(wild-type) and with the deletion (— 27bp) were cloned into the 
pGL3-promoter vector between the SV40 promoter and start codon 
of the luciferase gene to examine whether the deletion may poten- 
tially affect posttranscriptional regulation. The four constructs 
(LPGAT1.5UTR-B™ 1 , LPGAT1.5UTR-B~ 27bp , LPGATl.cApr07 wt , 
and LPGAT1. cApr07~ 27bp ) were co-transfected with 
pGL4.74[/tfl/«c/TK] into HepG2, Caco-2, 3T3-L1, and G8 myoblast 
cell lines. For the HepG2, Caco-2, and G8 myoblast cell lines, the 
luciferase activities for the constructs containing the 5'-UTRs were 
higher compared with the empty vector control (pGL3-PV), and 
there was a moderate but statistically significant decrease in lucifer- 
ase activities for the two 5'-UTR~ 27bp constructs compared with the 
5'-UTR wt constructs (Figure 4b,c.e). For the 3T3-L1 cells, there was 
not an increase in luciferase activities for the 5'-UTR constructs 
compared with the pGL3-PV control, however, similar to the other 
three cell lines, there was still a modest but statistically significant 
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FIGURE 4 Functional studies to examine whether the 27 bp deletion affects posttranscriptional regulation, (a) Diagram 
showing the two LPGAT1 5'-UTRs (isoform B and LPGAT1 .cApr 07) that were cloned into the pGL3-promoter vector. 
Open boxes indicate the regions that were cloned. Arrows indicate the location of the 27 bp deletion. Relative lucifer- 
ase activities for the LPGAT1 .5UTR-B™ 1 , LPGAT1 .5UTR-B- 27 bp , LPGAT1 .cApr07 wt , and LPGAT1 .cApr07- 27 bp con- 
structs in (b) HepG2, (c) Caco-2, (d) 3T3-L1 , and (e) G8 myoblast cell lines. Transfections were performed with equal 
amounts of plasmids. The constructs were co-transfected with pGL4.74[hR/uc/7K] and luciferase assays were done in 
replicates of six and the results are shown as means ± s.d. LPGAT1 , lysophosphatidylglycerol acyltransferase 1 ; 
pGL3-PV, pGL3-promoter vector no insert control; UTR, untranslated region; wt, wild-type (no deletion). 



decrease in luciferase levels for the 5'-UTR constructs containing 
the 27bp deletion (Figure 4c). These data suggest that the 27bp dele- 
tion may affect the mRNA for both LPGAT1 isoform B and 
LPGATl.cApr07. The free energies and RNA secondary structures 
for the full length LPGAT1 isoform B and partial LPGATl.cApr07 
transcripts with and without the 27bp deletion were predicted using 
the RNAfold Webserver (29,30). For both sets of transcripts, there 
was a 17.5 unit change in the minimum free energy predictions 
(LPGAT1 isoform B wt , -1893.97 kcal/mol; LPGAT1 isoform 
B~ 27bp , -1876.47, and LPGATl.cApr07 wt , -401.72 kcal/mol; 



LPGATl.cApr07~ 27bp , -384.22 kcal/mol); and, comparison of their 
predicted secondary structures show that the 27bp deletion results in 
the removal of two stem-loop structures attached to a larger loop, 
which is replaced by two small loops separated by one pair of nucle- 
otides (Figure 5). 

Discussion 

A GWAS for BMI in Pima Indians led us to analyze LPGAT1 as a 
candidate gene for human obesity. Variants in this gene were 
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FIGURE 5 Predicted RNA secondary structures for both normal (no deletion) and 
mutant (27 bp deletion) transcripts. The 27 bp deletion eliminated a double-stem 
loop structure and a smaller single doublelooped region formed (arrows). Only the 
region that was altered is shown. The alteration was the same for both sets of tran- 
scripts (isoform B and LPGATLcApr 07). LPGAT1, lysophosphatidylglycerol acyl- 
transferase 1 ; UTR, untranslated region. 



nominally, but reproducibly associated with maximum BMI and 
repeated BMI measures in two nonoverlapping population-based 
samples from the Gila River Indian Community. As might be 
expected, these variants are also associated with percent body fat 
within a subset of individuals who have measures of body composi- 
tion. We are not aware of any other published GWASs or large 
meta-analyses reporting an association between this locus and obe- 
sity. Our GWAS lead variant, rs4951552, whose major/minor alleles 
are opposite in Native Americans compared with Caucasians, also 
had a nominal association with both BMI and percent body fat 
(additive P = 0.03) in a GWAS of ~ 1,000 Caucasian subjects (P. 
Kovacs, personal communication), but the obesity risk alleles were 
also flipped. However, the finding of a significant association 
between rs4951552 and BMI in Caucasians might not be expected 
because rs4951552 is unlikely to be a functional variant and the LD 
in this region is much higher in Pima Indians compared with Cauca- 
sians. In addition, our observation that two haplotypes, which only 
differ by the 27bp deletion (not present or very rare in Caucasians) 
are associated with either BMI in the full-heritage Pima Indians or 
mixed-heritage Native Americans suggests that either a rare func- 
tional variant is being captured by different haplotypes in different 
ethnic groups, or it suggests the presence of multiple functional al- 
leles. We are in the process of obtaining whole genomic sequence 
data for Pima Indians to better evaluate the role of ethnic specific 
variants on BMI. 

Haplotype 2, which showed the strongest association with BMI in 
the full-heritage Pima Indians, was the only haplotype associated 
with LPGAT1 gene expression levels in muscle. Although a similar 
trend was observed between haplotype 2 and LPGAT1 expression in 
adipose tissue, the association did not reach significance. The rea- 
sons) for this difference is unclear. It could be an artifact (false 
positive or false negative) due to having a small number of tissue 



samples from individuals homozygous for haplotype 2 (muscle, n = 
10; adipose n = 6, Figure 1). Alternatively, the tissue specific nature 
of the correlation may be driven by valid differences in biology. For 
example, LPGAT1 isoform A has an alternative 5'-UTR that is 
located downstream of the region containing the 27bp deletion (Sup- 
plementary Figure SI online and Figure 4), therefore differences in 
expression between the isoforms in different tissues may obscure an 
association with haplotype 2 and LPGAT1 expression levels in 
adipose. 

The 27bp deletion identified in the 5'-UTR (exon 1) of isoform B 
removes a predicted E2F (transcription factor important for adipo- 
genesis) binding site, but no significant difference in transcriptional 
regulation was observed between the LPGAT1-4™ 1 and LPGAT1- 
4— 27bp isoform A and isoform B promoter constructs in the four cell 
lines examined in this study. However when the 5'-UTRs of 
LPGAT1 isoform B and LPGATl.cApr07 containing the 27bp dele- 
tion were cloned as the 5'-UTR for the luciferase gene, the deletion 
resulted in a decrease in luciferase activity compared with the no de- 
letion controls suggesting that the deletion may affect posttranscrip- 
tional regulation. The deletion was also predicted to affect the free 
energies and secondary structures of the LPGAT1 transcripts, and it 
may be these changes that are responsible for the decrease in lucif- 
erase levels either by affecting translation efficiency or mRNA sta- 
bility. Further in vitro transcription/translation and mRNA decay 
experiments need to be done to address those possibilities. 

A recent report demonstrated that LPGAT1 has monoacylglycerol 
acyltransf erase like enzyme activity (16) and MGATs along with 
DGATs are involved in various lipid and energy metabolism related 
functions (4,16). It was observed that LPGAT1 knockdown in the 
liver of db/db mice resulted in a decrease of serum TAG, total cho- 
lesterol, free fatty acids, and body weight, and therefore suggested 
that LPGAT1 may be involved in liver TAG synthesis and secretion 
(16). A second group has also shown that LPGAT1 is localized in 
the endoplasmic reticulum and may be involved in the reacylation 
(remodeling) of LPG back to phosphatidylglycerol (15). Phosphati- 
dylglycerol is a phospholipid with various biological roles including 
serving as a precursor for cardiolipin a major phospholipid found in 
the inner mitochondrial membrane and important for the proper 
function of enzymes involved in mitochondrial energy metabolism 
(15,31,32). It is also thought that remodeling of cardiolipin may 
generate pathological forms of the phospholipid, which may be re- 
sponsible for the mitochondrial dysfunction seen with various dis- 
eases such as obesity and diabetes (32,33). In preadipocyte cDNA 
we observed an alternative exon located between exons 1 and 2 of 
isoform B that matched a partial LPGAT1 transcript shown in the 
AceView gene models track (UCSC Genome Browser), which may 
contain a putative mitochondrial signaling peptide. Yang et al. (15), 
identified and characterized an alternative human LPGAT1 gene 
(NCBI accession BC034621; isoform A, Supplementary Figure SI 
online and Figure 5a) that contains a larger alternative exon 1 and 
they show that this LPGAT1 isoform is localized to the endoplasmic 
reticulum in Cos-7 cells (15). We also screened muscle cDNA iso- 
lated from several Pima Indian subjects (data not shown), and 
observed PCR products corresponding to both the partial AceView 
transcript containing the alternative exon, and to exons 1 and 2 of 
isoform B, which contains the same endoplasmic reticulum signaling 
peptide identified by Yang et al.(15). Based on these observations, it 
is possible that LPG ATI may have multiple subcellular localizations 
(e.g., endoplasmic reticulum and mitochondrial) and therefore could 
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potentially have multiple functions (e.g., LPGAT and/or monoacyl- 
glycerol acyltransferase activities) in different cells or within the 
same cells. Further fractionation and/or immunocytohistochemical 
studies need to be done to verify that LPAGT1 is indeed localized 
to the mitochondria in preadipocytes and other cell types. 

In conclusion, we identified several variants in the LPGAT1 region 
that are nominally but reproducibly associated with obesity in Pima 
Indians and other Native Americans. Functional studies suggest that 
the 27bp deletion in the 5'-UTR could possibly affect LPGAT1 
mRNA stability; however, our association studies suggest that the 
deletion is unlikely to be the only functional polymorphism in this 
region. 
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